01C0 TEAM 5. 

DATE: 10/28/90 
INPUT SET: S13439.mw 



I This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



SEQUENCE LISTING 
j General information: ® f* 'j^ 

TITLE 0 P iromo.. MuUival.nt Vaccine K . Clostridia 
Botulinum Neurotoxin 
(iii) NUMBER OF SEQUENCES: 82 

( iv) CORRESPONDENCE ADDRESS: 

/ A^ ADDRESSEE: Medlen & Carroll 
5 StSeT: 220 Montgomery Street, Suite 2200 
(C) CITY: San Francisco 
in\ STATE: California 
( ( E) COUNTRY: United States of America 
(F) ZIP : 94104\, 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
) -n \ COMPUTER: IBM PC compatible 

S C OPKB«I»0 f „ ™, PC-DOS/MS - O ^ 
(D) SOFTWARE: Patentln Release hi.m, 

ivi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(Viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Ingolia, Diane E. 
)u\ PFrtTSTRATION NUMBER : 40,027 
( ( c! rIfScE/DOCKET NUMBER: OPHD-02304 

(ix) TELECOMMUNICATION ^FORMATION: 
1 (A) TELEPHONE: (415) 705-8410 

■(B) TELEFAX: (415) 397-8338 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pair 

(B) TYPE: nucleic acid 



DATE: 10/28/96 

FA 1 input SET: S13439.raw 

IC) STRANDEDNESS: single 
J 7 J D) T oPOLOGY: linear 

50 (ii) MOLECULE TYPE: DNA (genomic) 
51 

11 (Xi) SEQUENCE DESCRIPTION: SEQ ID HO.l. 

Is GGAAATTTAG CTGCAGCATC TGAC 

II (2) INFORMATION FOR SEQ ID NO: 2: 

11 <i) SEQUENCE CHARACTERISTICS: 

60 U) (A) LENGTH: 24 base pairs 

61 B type: nucleic acid 

62 ' S T R ANDEDNESS: single 
|3 | D) TOPOLOGY: linear 

H (ii) MOLECULE TYPE: DNA (genomic) 
67 

To (xi) SEQUENCE DESCRIPTION : SEQ ID NO:2: 

72 TCTAGCAAAT TCGCTTGTGT TGAA 

]l (2 ) INFORMATION FOR SEQ ID NO:3: 

11 (i) SEQUENCE CHARACTERISTICS: 

76 ( ' (A) LENGTH: 20 base pairs 

77 B T YPE: nucleic acid 

7 8 ' S T R ANDEDNESS: single 

79 m TOPOLOGY: linear 

80 K ' 

l\ (ii) MOLECULE TYPE: DNA (genomic) 



(Ki) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

si CTCGCATATA GCATTAGACC 

90 (2) INFORMATION FOR SEQ ID NO: 4: 

l\ (i) SEQUENCE CHARACTERISTICS: 

92 (A) LENGTH: 19 base pairs 

H B) TYPE : nucleic acid 

94 c STRANDEDNESS: single 

9 | ; D) TOPOLOGY: linear 

% (ii) MOLECULE TYPE: DNA (genomic) 



DATE; 10/28/96 
TIME: 17:40:08 



INPUT SET: S13439.raw 



100 
101 
102 
103 
104 
105 
106 
107 
108 
109 
110 
111 
112 
113 
114 
115 
116 
117 
118 
119 
120 
121 
122 
123 
124 
125 
126 
127 
128 
129 
130 
131 
132 
133 
134 
135 
136 
137 
138 
139 
140 
141 
142 
143 
144 
145 
146 
147 
148 
149 
150 
151 
152 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CTATCTAGGC CTAAAGTAT 
(2 ) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
( ' (A) LENGTH : 8133 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ii) MOLECULE TYPE : DNA (genomic) 



(ix) FEATURE: 

1 (A) NAME /KEY : CDS 

(B) LOCATION: 1..8130 

(xi ) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 
ATC TCT TTA ATA TCT AAA GAA TTA ATA AAA 

Met Ser Leu He Ser Lys ^ ^ 

££££££ S ESSE 

100 1U 



CTC GCA TAT AGC ATT 
Leu Ala Tyr Ser He 
15 

ACT AAT TTA GAC GAA 
Thr Asn Leu Asp Glu 
30 

AAA TAT TTG CAA TTA 
Lys Tyr Leu Gin Leu 
45 

AAT AAA TAT AAA ACT 
Asn Lys Tyr Lys Thr 
60 

AAA GAT ATA TTA AAA 
Lys Asp He Leu Lys 
80 

CCT GTA GAA AAA AAT 
Pro Val Glu Lys Asn 
95 

GAT ATT GCT CTT GAA 
Asp He Ala Leu Glu 
110 

TAT AAT ATT AAA CTG 




PAGE: 4 



DATE: 10/28/96 
TIME: 17:40:11 



153 
154 
155 
156 
157 
158 
159 
160 
161 
162 
163 
164 
165 
166 
167 
168 
169 
170 
171 
172 
173 
174 
175 
176 
177 
178 
179 
180 
181 
182 
183 
184 
185 
186 
187 



Tyr He Lys Gin Trp 
115 



Ala Asp 



He Asn 
120 



Ala Glu Tyr Asn 
125 



TGG TAT 
Trp Tyr 
130 

GTT GAA 
Val Glu 
145 

CAA AAT 
Gin Asn 



TTT ATA 
Phe He 



ATC AAT 
He Asn 



GAT AGT GAA 
Asp Ser Glu 

TCT TCT ACC 
Ser Ser Thr 



CCT CAA TTT 
Pro Gin Phe 
165 



GCA TTC 
Ala Phe 
135 

ACT GAA 
Thr Glu 
150 

GAT AAT 
Asp Asn 



TTA GTA 
Leu Val 



AAT ACA 
Asn Thr 



GCA TTA 
Ala Leu 



ATG AAA 
Met Lys 



CTA GTA 
Leu Val 
210 



TAT GAT AGA 
Tyr Asp Arg 
180 

AAA CCT ACA 
Lys Pro Thr 
195 

TCT GAA TAT 
Ser Glu Tyr 



CAA AAA AGG TTT 
Gin Lys Arg Phe 
185 

GTA CCT ACA ATA 
Val Pro Thr He 
200 

AAT AGA GAT GAA 
Asn Arg Asp Glu 
215 



CAG CTA 
Gin Leu 
155 

TTT TAC 
Phe Tyr 
170 

ATA AAT 
He Asn 



CTA AAA 
Leu Lys 
140 

CTA GAG 
Leu Glu 



INPUT SET: S13439.raw 
He Lys Leu 

AAG GCT ATA 432 
Lys Ala He 



AAA AAA 
Lys Lys 



GAA GAG ATT 
Glu Glu He 
160 

AGG ATG GAA 
Arg Met Glu 
175 



TAT TAT 
Tyr Tyr 



GAT GAT 
Asp Asp 



ACT GTA 
Thr Val 



ATT ATA 
He He 
205 

TTA GAA 
Leu Glu 
220 



AAA TCT CAA 
Lys Ser Gin 
190 

AAG TCT CAT 
Lys Ser His 

TCA TAT AGA 
Ser Tyr Arg 



ACA AAT 
Thr Asn 
225 

GCT AAT 
Ala Asn 



TCT TTG AGA 
Ser Leu Arg 



189 

190 

191 

192 

193 

194 

195 

196 

197 

198 

199 

200 

201 

202 

203 

204 

205 



GAG TTG 
Glu Leu 



AGT TTG TTT 
Ser Leu Phe 
245 

TTA AAT CGT 
Leu Asn Arg 
260 



AAA ATA AAT AGT 
Lys He Asn Ser 
230 

ACA GAA CAA GAG 
Thr Glu Gin Glu 



GGA AAT TTA GCT 
Gly Asn Leu Ala 
265 



AAT CAT 
Asn His 
235 

TTA TTA 
Leu Leu 
250 

GCA GCA 
Ala Ala 



GGG ATA 
Gly He 



GAT ATC AGG 
Asp He Arg 
240 



AAT ATT 
Asn He 



TCT GAC 
Ser Asp 



TTA TTA 
Leu Leu 

CTT CCA 
Leu Pro 
290 

TCT ATT 
Ser He 
305 

AAG TAT 
Lys Tyr 



GCC CTA AAA 
Ala Leu Lys 
275 

GGT ATT CAC 
Gly He His 

GGA CTA GAC 
Gly Leu Asp 



AAT TTT GGC GGA 
Asn Phe Gly Gly 
280 

TCT GAT TTA TTT 
Ser Asp Leu Phe 
295 

CGT TGG GAA ATG 
Arg Trp Glu Met 
310 



GTA TAT 
Val Tyr 



TTA GAT 
Leu Asp 
285 



TAT AGT CAG 
Tyr Ser Gin 
255 

ATA GTA AGA 
He Val Arg 
270 

GTT GAT ATG 
Val Asp Met 



AAA ACA 
Lys Thr 



AAA AAA TAT 
Lys Lys Tyr 



ATA AAT AAT TAT 
He Asn Asn Tyr 



ATA AAA 
He Lys 
315 

ACA TCA 
Thr Ser 



ATA TCT 
He Ser 
300 

TTA GAG 
Leu Glu 



AGA CCT AGC 
Arg Pro Ser 



GAA AAC 
Glu Asn 



GCT ATT ATG 
Ala He Met 
320 

TTT GAT AAA 
Phe Asp Lys 



DATE: 10/28/96 
™E: 17:40,4 

INPUT SET: S13439.mw 

325 

206 J/ , ftc , 

... „ a-ra na& AGT AAA 1056 



207 m[pA ... raT AAT TTT AAA CTC ATT ATA GAA AGT AAA 

in iz z si - ™ £ " - ss Lys " u Ile Ile - ser 

210 340 

£ £ si = k s s i s: = °" - £ - ~ - 

214 355 360 

III OAT CTT OAA ATT »» « TTC OCT TTA «= -J »» - CAA 

217 Asp Leu Glu He Lys He Ala Phe A±a ^ 

218 370 375 

219 p . , nnT TCA TAT CTT ACT AAC CTA GTA ATA GAA 

III HI T,l Si - £ Si " - Sr ™ ». - v.i xi. -j- 

222 385 390 

i si ss £ s s s - - = S - ~ s - £ ila 

226 405 

HI ATA - TCT AAT AAC TTC ACA «T - ACT AAA ATT TTT CAT CAT 

229 He Glu Ser Asp Asn Asn Pne inr « f 430 

230 420 

231 n» nar TCT ATG TTT TTA AC A AAA 
Ml TCA TTA TTT AAT TCA OCT ACC CCA OAA AAC TCT £ ^ ^ ^ ^ 

233 Ser Leu Phe Asn Ser Ala inr ^ 445 

234 435 

235 m , n>1 rrT TTT A TG CCA GAA GCT CGC TCC AC A 
I',*, Si S Pro £ - °" £ S5 S Met Pro Olu Ala Ar g ser Thr 

238 450 455 

239 , ns r( ,, GCT TAT GCG TCA GCT TAC TAT GAT TTC 
l\l Si SE ™ sZ Z SJ S Z Al, Ser A l3 TyI TyI A SP Ph. 

242 465 470 

243 » . m arT *TA GAA aaa ACT TTA AAA GCA TCA GAT 
\% S £ £ Si JS 55 Si - ,js THr «u L y s AU Ser A SP 

246 485 

S » s Si s - = £ Si S i= 2 = Si = S Si 

I S i Si Si S = s 5 s s ss Si S £ Si Si s 

1 si s s: - S S 5 £ " s ST S E S = Si 



1104 



1152 



1200 



1248 



258 530 



